Apertus is a fully open multilingual large language model developed by Swiss AI, offering two parameter scales of 7 billion and 8 billion. This model supports over 1000 languages, uses fully compliant and open training data, and its performance can rival that of closed-source models. Apertus was pre-trained on 15T tokens and adopts a phased curriculum training method, supporting a context length of up to 65,536 tokens.
Natural Language Processing
TransformersOther